Context Training Training Cross Testing
نویسندگان
چکیده
Table 3: Results of comparing hidden layer sizes (6-context). Training was done on 573 items, using a cross validation set of 258 items. and perhaps even to other markup recognition problems , and (iv) compare the use of the neural net with more conventional tools such as decision trees and Hidden Markov Models. for assistance in nding references and determining the status of related work. Special thanks to Prof. Franz Guenthner for introducing us to the problem. 6 Sharp). Also included in this group are items such as U.S. Supreme Court or U.S. Army, which are sometimes mislabeled because U.S. occurs very frequently at the end of a sentence as well. 22.5% false negative due to an abbreviation at the end of a sentence, most frequently Inc., Co., Corp., or U.S., which all occur within sentences as well. 11.0% false positive or negative due to a sequence of characters including a punctuation mark and quotation marks, as this sequence can occur both within and at the end of sentences. 9.2% false negative resulting from an abbreviation followed by quotation marks; related to the previous two types. 9.8% false positive or false negative resulting from presence of ellipsis (...), which can occur at the end of or within a sentence. 9.9% miscellaneous errors, including extraneous characters (dashes, asterisks, etc.), un-grammatical sentences, misspellings, and parenthetical sentences. The results presented above (409 errors) are obtained when both t 0 and t 1 are set at 0:5. Adjusting the sensitivity thresholds decreases the number of punctuation marks which are mislabeled by the method. For example, when the upper threshold is set at 0:8 and the lower threshold at 0:2, the network places 164 items between the two. Thus when the algorithm does not have enough evidence to classify the items, some mislabeling can be avoided. 5 We also experimented with diierent context sizes and numbers of hidden units, obtaining the results shown in Tables 2 and 3. All results were found using the same training set of 573 items, cross-validation set of 258 items, and mixed-case test set of 27,294 items. The \Training Error" is one-half the sum of all the errors for all 573 items in the training set, where the \error" is the diierence between the desired output and the actual output of the neural net. The \Cross Error" is the equivalent value for the cross-validation set. These two error gures give …
منابع مشابه
Effects of concentric and eccentric training on muscle strength, cross-sectional area, and neural activation.
We compared the effects of concentric (Con) and eccentric (Ecc) isokinetic training on quadriceps muscle strength, cross-sectional area, and neural activation. Women (age 20.0 +/- 0.5 yr) randomly assigned to Con training (CTG; n = 16), Ecc training (ETG; n = 19), and control (CG; n = 19) groups were tested before and after 10 wk of unilateral Con or Ecc knee-extension training. Average torque ...
متن کاملfor ” Multi View Registration for Novelty / Background Separation ”
Figure 1 shows 7 examples of one of the places in our data set. Despite the fact that we had more than these number of images per place, in this paper, we did not use all of them: we used 4 images for training/cross validation and 3 for testing purposes. Note that using 4 images for training means that we have 4 × ( 6 5 ) = 24 different choices for training using 5 reference images, and similar...
متن کاملTransfer learning for cross-company software defect prediction
0950-5849/$ see front matter 2011 Elsevier B.V. A doi:10.1016/j.infsof.2011.09.007 ⇑ Corresponding author. Tel.: +86 028 61830557; fa E-mail addresses: [email protected] (Y. Ma), g [email protected] (X. Zeng), [email protected] Context: Software defect prediction studies usually built models using within-company data, but very few focused on the prediction models trained with cross-company da...
متن کاملTheoretical Analyses of Cross-Validation Error and Voting in Instance-Based Learning
This paper begins with a general theory of error in cross-validation testing of algorithms for supervised learning from examples. It is assumed that the examples are described by attribute-value pairs, where the values are symbolic. Cross-validation requires a set of training examples and a set of testing examples. The value of the attribute that is to be predicted is known to the learner in th...
متن کاملContextual constraints on memory retrieval at six months.
In 3 experiments, 6-month-old infants learned to move a mobile by kicking and were tested 1 to 21 days later for retention of the newly acquired memory as a function of the training and testing contexts. In Experiment 1, decreasing the relative distinctiveness of the training and testing context did not impair retrieval of the newly acquired memory. In Experiment 2, however, testing in a differ...
متن کاملThe optimal time of day for training during Ramadan: A review study
Literature concerning the effects of Ramadan fasting on sports performance presents conflicting results. In this context, some studies reported a significant impairment of sports performance during the month of Ramadan. However, other studies suggested that Ramadan fasting has no significant effect on physical performance. The discrepancies between the studies could be explained by time-of-day...
متن کامل